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1  INTRODUCTION 

The  first  reported  expert  systems  had  production-rule  knowledge  representation  schemes, 
examples  of  which  are  Mycin  [Shortliffe76],  Prospector  [Duda79]  and  R1  [McDermot82], 
Production-rules  are  conceptual  models  of  human  reasoning  which  have  many  undesirable 
features.  Even  so,  they  are  still  widely  used  by  expert  system  developers  and  form  the  basis  of 
the  majority  of  commercial  expert  system  shell  building  kits.  The  major  reasons  for  this 
loyalty  to  unstructured  production-rules  are  as  follows:  - 


1 )  Little  effort  is  required  in  understanding  their  basic  concepts. 

2 )  They  are  well  understood  by  knowledge  engineers. 

3)  It  is  easy  for  experts  to  reason  in  a  form  similar  to  production-rules, 
therefore  it  is  thought  that  knowledge  acquisition  is  made  simple. 

4)  Rapid  prototyping  is  achievable  using  simple  rule-based  systems. 

5)  The  addition  of  "new"  knowledge  and  the  removal  of  "old"  knowledge 
can  be  straightforward  using  a  rule-based  expert  system  (this  is  not 
necessarily  the  case). 

6)  Historically,  expert  systems  have  been  developed  by  these  conceptual 
models. 


Although  rule-based  expert  systems  may  seem,  at  first  glance,  to  be  the  idea!  approach,  they 
are  far  from  satisfactory.  As  a  result,  modelling  techniques  such  as  Semantic  Networks 
|Findler79),  Predicate  Calculus  !Bundy83),  Frames  [Winston84j  and  more  recently  n-Cube 
|Cosgrove90|  have  been  developed.  In  section  2,  a  simple  outline  of  production-rule 
representation  schemes  is  given.  Section  3  then  discusses  the  limitations  of  such  schemas  by 
analysis  of  a  simple  production-rule  knowledge-base.  Section  4  describes  the  n-Cube 
knowledge  representation  scheme,  after  which  section  5  shows  how  the  the  n-Cube  s 
conceptual  model  and  search  mechanisms  can  eliminate  the  disadvantages  associated  with 
production-rule  schemas. 
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2  RULE-BASED  KNOWLEDGE  REPRESENTATION  SCHEMES 

Production-rules  are  a  model  of  a  domain  expert's  reasoning  process.  A  simple  example  of 
such  a  rule  is  as  follows:- 


IF  XI  AND  X2  THEN  Y. 

Here  XI  and  X2  are  the  antecedents  (known  facts  or  knowledge  that  have  to  be  validated)  and 
Y  a  suggestion,  hypothesis,  consequence,  action  or  conclusion.  The  relationships  between  a 
number  of  production-rules  are  usually  understood  by  analysis  of  AND/OR  trees  [Winston79], 
a  simple  example  of  which  is  shown  in  figure  1.  The  semantics  of  this  tree  are  as  follows:- 


c  is  true  if  and  only  if: 
a  is  true  if  and  only  if: 

or 

or 


f  and  g  are  true, 
b  and  c  are  true 
d  is  true 
e  is  true. 


a 


Figure  1.  A  simple  AND/OR  graph 

This  graphical  description  of  the  rule  structure  assists  in  knowledge-base  analysis  and  design. 
For  instance,  high  quality  graphical  tree  representations  are  generated  in  commercial  expert 
system  shell  development  tools  such  as  NEXPERT  [Neuron88J.  This  graphical  approach 
simplifies  knowledge-base  development,  unfortunately,  it  is  impractical  for  complex 
interrelated  rule  structures  consisting  of  either  basic  propositional  logic  or  hybrid  rule/fact 
schemas. 
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2.1  Propositional  Logic 

The  simplest  form  of  production-rules  are  based  upon  propositional  logic  [Luger89],  for 
example  the  rules  which  model  the  propositional  sequence  of  months  of  the  year  are  as 
follows:- 

If  the  current  month  is  January  Then  the  next  month  is  February. 

If  the  current  month  is  February  Then  the  next  month  is  March. 

etc.  (  12  rules  in  total). 

This  static  schema  is  inflexible  therefore  knowledge-base  restructuring  can  be  time 
consuming.  In  addition,  propositional  logic  has  expensive  memory  storage  requirements 
which  may  lead  to  poor  system  performance. 

2.2  Hybrid  Rule/Fact  Schemas 

Hybrid  rule/fact  relationships  (also  known  as  Predicate  Calculus  )  have  been 
developed  [Bundy83]  to  overcome  the  disadvantages  of  rule-based  propositional 
schemas.  Hence,  the  above  propositional  knowledge-base  can  be  modelled  as  follows:- 

If  the  current  month  is  MONTH1 

And  MONTH2  follows  MONTH  1 

Then  the  next  month  is  MONTH2. 


(1  rule). 


February  follows  January. 

March  follows  February. 

etc.  (12  facts  in  total). 

This  schema  is  more  versatile  than  propositional  logic,  even  so  it  has  a  number  of 
disadvantages  in  common  with  propositional  logic  rule-based  systems.  These 
undesirable  features  are  discussed  in  section  3. 


3  THE  LIMITATIONS  OF  RULE  BASED  KNOWLEDGE 
REPRESENTATION  SCHEMES 

Although  rule-based  expert  systems  can  be  created  rapidly,  they  have  a  number  of 
undesirable  features.  To  highlight  these  a  simple  and  incomplete  expert  system,  shown  in 
figure  2,  will  be  analysed.  This  has  been  used  as  a  tutorial  [Minasi90a,  Minasi90b]  but  it  does 
not  investigate  any  of  the  downfalls  associated  with  such  schemas.  The  tutorial  rule-based 
expert  system  was  designed  by  an  experienced  knowledge  engineer;  even  so  it  has  many 
undesirable  features.  Hence,  if  a  novice  attempted  to  construct  such  a  rule-based  system,  then 
there  is  a  high  probability  that  the  performance  of  the  completed  system  would  be  far  from 
satisfactory. 
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Rl. 

If 

has  green  leaves 

and 

has  yellow  flowers 

and 

is  a  herb 

then 

is  a  daffodil. 

R2. 

If 

has  green  leaves 

and 

has  no  flowers 

and 

is  a  tree 

and 

is  evergreen 

then 

is  a  douglas  fir. 

R3. 

If 

is  a  mushroom 

then 

is  a  jack  o'lantem. 

R4. 

If 

has  green  leaves 

and 

has  no  flowers 

and 

is  seaweed 

then 

is  a  sea  lettuce. 

R5. 

If 

has  green  leaves 

and 

has  no  flowers 

and 

is  a  herb 

and 

is  land  based 

then 

is  a  lettuce. 

R6. 

If 

has  green  leaves 

and 

is  a  tree 

and 

is  land  based 

and 

is  deciduous 

then 

is  a  pin  oak. 

R  7. 

If 

height  is  greater  than  5  feet 

and 

stem  is  woody 

then 

is  a  tree. 

R8. 

If 

stem  is  not  woody 

then 

is  a  herb. 

R9. 

If 

is  a  tree 

and 

has  needles 

then 

is  evergreen 

RIO. 

If 

is  a  tree 

and 

has  green  leaves 

then 

is  deciduous. 

R 11. 

If 

produces  spores 

and 

is  not  green 

and 

is  land  based 

then 

is  a  mushroom. 

is  not  land  based 
then  is  a  seaweed 


Figure  2.  A  Simple  Plant  Identifier  Rule-Base 
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3.1  Failure  due  to  Inconsistency 

Analysis  of  the  tutorial  expert  system,  shown  in  figure  2,  identifies  that  rule  R2  and 
rule  R9  contradict  each  other.  For  example,  rule  R2  can  only  succeed  if  "is  evergreen"  is 
true.  Furthermore,  "is  evergreen "  is  true  if  and  only  if  rule  R9  is  true.  This  requires  has 
needles"  to  be  true.  Hence,  rule  R2  can  only  succeed  if  and  only  if  "has  needles"  and  "has 
green  leaves"  are  true.  This  can  never  occur,  consequently  rule  R2  can  never  succeed. 

The  unstructured  nature  of  simple  rule-based  schemas  facilitates  ad-hoc:  addition  and 
deletion  of  rules,  addition  and  deletion  of  antecedents,  and  modifications  to 
hypotheses.  As  a  result,  the  probability  of  failure  due  to  inconsistency  increases  with 
rule  complexity  and  knowledge-base  size. 

3.2  Duplication 

A  second  undesirable  feature  of  rule-based  expert  systems  is  duplication.  For  example, 
the  rules  that  describe  the  plant  "douglas  fir’can  be  displayed  in  the  form  of  an  A.\D 
graph  as  shown  in  figure  3.  This  graph  shows  that  "douglas  fir"  is  true  if  all  the 
children  in  the  tree  are  true.  By  inspection  ,  the  fact  "is  a  tree"  occurs  twice  which 
indicates  that  there  is  duplication  in  the  rule  structure.  In  addition,  the  mutually 
exclusive  pair  ("has  green  leaves"  and  "has  needles")  emphasise  the  inconsistency- 
discussed  above. 


Douglas  Fir 


has  green  leaves 


is  evergreen 


is  a  tree  has  no  flowers 


is  a  tree 


has  needles 

is  taller 

stem  is 

than  5 
feet 

woody 

Figure  3.  AND  Graph  of  the  ru’es  associated  with  Douglas  Fir 


Duplication  leads  to  vague  explanations  of  conclusions.  Furthermore,  it  causes 
unnecessary  questioning  which  in  turn  can  lead  to  the  searching  of  irrelevant  rules. 
Consequently,  this  undesirable  rule-based  schema  feature  affects  both  system 
performance  and  user  confidence. 
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3.3  Incorrect  Results 

Another  disadvantage  of  simple  rule-based  expert  systems  is  that  they  may  suggest 
incorrect  results  or  search  along  incorrect  paths.  In  so  doing,  silly  or  irrelevant  questions 
may  be  asked.  For  example,  if  the  plant  to  be  identified  was  a  sea  lettuce  and  rule  R8 
was  investigated  before  rule  R4,  then  plant  type  herb  would  become  true  and  the  system 
could  investigate  sub  classes  of  herb.  This  is  obviously  the  wrong  search  space  to  focus 
upon,  but  because  the  fact  "is  land  based"  has  been  overlooked  in  rule  R8  the  system 
deduces  that  the  plan  is  a  herb.  Hence,  the  rule  firing  and  rule  interactions  must  be 
analvsed  to  ensure  that  the  system  s  performance  is  not  susceptible  to  incorrect  results. 
Unfortunately  this  is  an  iterative  and  time  consuming  process  which  in  turn  could  lead 
to  further  undesirable  features. 

3.4  Updating  Difficulties 

i  he  final  drawback  of  simple  rule-based  expert  systems  is  due  to  the  difficulties  in 
upgrading  or  adding  new  knowledge.  For  instance,  if  the  "is  a  mint"  rule  (shown  below! 
was  added  to  the  knowledge-base,  then  a  detailed  analysis  of  the  existing  rules  must  be 
undertaken  so  that  conflicts  and  inconsistencies  do  not  occur.  If  this  is  not  accomplished, 
subtle  errors  that  are  difficult  to  detect  and  resolve  may  arise  and  therefore 
drastically  reduce  the  system's  performance. 


R  13. 

If 

is  a  herb 

and 

has  no  flowers 

and 

has  has  small  leav 

then 

is  a  mint. 

The  problem  with  adding  a  '-ule,  such  as  rule  R13,  is  that  a  conflict  arises  between  this 
new  rule  and  rule  R4  This  is  due  to  the  knowledge  engineer  overlooking  the  fact  that 
is  land  based  does  not  occur  in  the  rule  which  defines  herb.  Even  if  this  was  not 
overlooked  then  rule  R5  would  have  to  be  revised  for  consistency.  This  revised  rule 
requires  at  least  one  additional  antecedent  as  shown  by  the  following  rule:- 


R3. 


1  f  has  green  leaves 

and  does  not  ha're  small  leaves 

and  has  no  flowers 

and  is  a  herb 

and  is  land  based 

then  is  a  lettuce. 
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4  THE  n-CUBE  KNOWLEDGE  REPRESENTATION  SCHEME 


To  overcome  the  difficulties  assoc.ated  with  simple  production-rule  knowledge-based 
systems,  a  number  of  structured,  flexible  schemas  have  been  devised.  This  section  discusses 
the  n-Cube  knowledge  representation  scheme  and  shows  how  it  can  overcome  the 
disadvantages  of  production-rule  systems  applied  to  classification  problems. 

The  n-Cube  architecture  is  basically  a  classification  decision  tree  consisting  of  a  Parent 
Class  (P)  which  is  the  root  of  the  tree,  a  set  of  Sub-Classes  C  )  which  are  the  tree  nudes,  a  set 
of  Hypotheses  (H)  which  are  the  leaves  of  the  tree,  an  associated  Rule  Set  (R)  that  locally 
identifies  the  conditions  required  to  validate  either  P  or  S  or  H,  additional  Inheritance  (I) 
default  information  defined  bv  True  (T)  statements  and  False  (F)  statements.  The  structure  of 
this  hierarchical  architecture,  shown  in  figure  4  ,  is  defined  below. 

The  n-Cube's  root  node,  in  which  R  =s  P=>  I,  is  known  as  level  1  The  n-Cube's  lee t  nodes  in 
which  R  =»  H  =>  I  is  known  as  level  m.  Furthermore,  assume  every  branch  in  the  n-Cube  has 
an  equal  depth  :f  m.  Thus,  each  node  in  the  n-Cube's  structure  is  represented  by  a  quadruple  of 
the  form  :- 

(dt,  R,  T,  F)  where  (X  can  be  either:  P,  S  or  H. 

Then  by  induction,  the  structure  can  be  defined  as  follows:  - 

Level  1  is  the  set  {(P,  0,  T,  6)}  containing  a  single  quadruple,  where  (!)  is  the  empty  sc ,. 

Level  n  is  the  set  {(Sn,i,  Rn,i,  Tn,i,  Pn.i)}  1  <  n  <  m  ,  i  =  0 k,  where  for  each  i 

ve'1'  is  some  j  such  that  Sn,i  =  {S  €  Sn-l,j ;  Rr>  i}  . 

Level  m  is  the  set  {(Hm,i,  Rm,i,  Trrui,  Fm,i)}  >  =  0 . k,  where  for  each  i  there  is 

some  |  such  that  Hm,i  =  {S  e  Sm-l,j ;  Rm,i}  . 


R  =>  P  =7  I 


f  igure  4.  The  structure  of  the  n-Cube  knowledge  representation  scheme 
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The  n-Cube  syntax  used  to  model  the  above  structure  is  based  upon  a  3-tuple  as  follows:- 

Level  1  consists  of:  rule(P,[S2,l,  S2/2],[])  is  true  if  R1  is  true. 

Level  2  consists  of:  rule(S2,  1,  [H3,l,  H3,  2),  [P])  is  true  if  R2,l  is  true. 

rule(S2,  2,  [H3,3,  H3,  4],  [P])  is  true  if  R2,2  is  true. 

Level  3  consists  of:  rule(H3,l,  [],[P,S2,1])  is  true  if  R3,l  is  true. 

ruleO-13,2,  [],[P/S2,1])  is  true  if  R3,2  is  true. 

rule(H3,3,  []/[P/S2,2])  is  true  if  R3,13is  true. 

rule(H3,4,  [],[P;S2,2])  is  true  if  R3,4  is  true. 

In  addition,  each  of  the  above  3-tuples  has  an  associated  I  list  of  T  and  F  statements  so  that 
further  knowledge  can  be  stored  at  a  specific  n-Cube  node. 


4.1  Elimination  of  Failure  due  to  Inconsistency 

The  n-Cube  schema  allows  the  knowledge  engineer  to  decompose  a  domain  of  expertise 
into  small  hierarchical  partitions.  This  can  assist  in  the  simplification  of  the  complex 
task  of  knowledge  acquisition  [Welbank83,  Cosgrove91]  and  enables  high  quality 
explanations  to  be  generated.  For  example,  the  n-Cube  representation  of  the  plant 
domain  tutorial  expert  system,  as  shown  in  figure  3,  hierarchically  models  this  domain 
of  expertise.  Deciduous"  is  defined  by  its  parent  classes  "tree",  "land  based"  and 
plant  domain  ;  to  distinguish  "deciduous  ’  from  "evergreen"  a  minimal  rule  set  is  used 
such  that  has  needles"  identifies  "evergreen"  and  "has  green  leaves"  identifies 
deciduous".  As  a  result,  inconsistencies  can  be  identified  by  inspection  of  the  rules  in 
each  P  to  H  path. 


4.2  Elimination  of  Duplication 

Duplication  seldom  occurs  in  the  n-Cube  schema.  The  reasons  for  this  are  twofold. 
First,  the  minimum  set  of  R  associated  with  each  P,  S  and  H  is  used.  Once  P  has  been 
identified  by  this  minimum  rule  set,  then  any  Sub-Classes  of  S  have  only  to  be 
identified  by  a  minimum  set  of  rules  that  distinguish  each  Sub-Class  of  S.  The  second 
reason  why  the  n-Cube  schema  is  capable  of  eliminating  duplication  is  because  of  its 
simple  structure,  therefore  the  knowledge  engineer  can  rapidly  identify  duplicated 
rule  antecedents. 
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4.3  Elimination  of  Incorrect  Results 

In  the  n-Cube  representation  of  the  plant  domain,  shown  in  figure  5,  a  new  category 
known  as  "land  based"  is  added  to  the  knowledge-base.  This  new  category  is  mutually 
exclusive  with  "seaweed",  therefore  both  appear  at  the  same  level  in  the  n-Cube's 
hierarchical  representation.  To  ensure  that  only  one  of  these  categories  succeeds  in  a 
knowledge-base  search,  the  rules  associated  with  these  two  classes  are  also  mutually 
exclusive  (rules  Cubel  and  Cube2).  This  allows  specific  incorrect  results,  that  are 
inherent  in  standard  rule-based  systems,  to  be  eliminated.  Hence,  these  explicit 
hierarchical  relationships  simplify  knowledge-base  updating  procedures  and  reduce 
the  probability  of  conflict. 


4.4  Enhanced  Updating 

The  n-Cubes  structured  format  allows  the  simple  addition  of  new  concepts  and  rules.  For 
instance,  if  a  new  plant  "mint"  was  to  be  added  to  the  n-Cube's  knowledge-base,  shown 
in  figure  6,  then  the  knowledge-engineer  knows  that  "mint"  is  a  herb".  This  would 
result  in  the  addition  of  a  3-tuple  to  the  knowledge-base  as  follows  :- 


rule(mint,[nil], [plant  domain,land  based,herb]) 

is  true  if  has  small  jagged  green  leaves 
and  has  no  flowers. 


Consequently,  the  rules  which  focus  the  n-Cube  search  to  the  category  herb"  remain 
constant,  but  the  rules  associated  with  the  Sub-Classes  of  "herb"  are  investigated  so 
that  mint",  "lettuce"  and  "daffodil"  can  be  uniquely  identified. 
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pin  oak 


X/ 

douglas  fir 


Figure  5.  The  n-Cube  hierarchical  description  of  the  tutorial  knowledge-base 
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Cube  1. 

ruledand  baseditree^ushroonvherbUplant  domain])  is  true  if  is  land  based. 

Cube  2. 

rule(seaweed,[sea  lettuceUplant  domain])  is  true  if  is  not  land  based. 

Cube  3. 

rule(tree,[pin  oak,  dougias  fir], [plant  domain,  land  based])  is  true  if  height  >  5  feet 

and  stem  is  woody. 

Cube  4. 

rule(mushroom,]jack  o'lantern],[plant  domain,  land  based])  is  true  if  produces  spores 

and  is  not  green. 

Cube  5. 

rule(herb,[lettuce,daffodil], [plant  domain,  land  based])  is  true  if  stem  is  not  woody- 
Cube  6. 

ruletsea  lettuce,[nil|, [plant  domain,seaweed])  is  true  if  has  green  leaves 

and  has  no  flowers. 

Cube  7. 

rule(deciduous,(pin  oak], [plant  domain,land  based,  tree])  is  true  if  has  green  leaves. 
Cube  8. 

rule(evergreen,[douglas  fir], [plant  domain, land  based, tree])  is  true  if  has  needles. 
Cube  9. 

rule(lettuce,(rul], [plant  domain,Iand  based,herb])  is  true  if  has  green  leaves 

and  has  no  flowers. 

Cube  10. 

rulefdaffodil, [nil], [plant  domain, land  based, herb])  is  true  if  has  green  leaves 

and  has  yellow  flowers. 

Cube  11. 

ruletpin  oak,[nil|, [plant  domain,land  based, tree,deciduous)  is  true  \ 

Cube  12. 

ruleldouglas  fir,[nil],[plant  domain,land  based, tree,evergreen))  is  true  *. 


*  Note  Cube  21  and  Cube  12  have  no  associated  rules  because  they  are  the  only 
Hypotheses  that  can  be  derived  from  their  specific  parents  S. 


Figure  6.  The  Rules  associated  with  the  n-Cube  Schema 
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5  CONCLUSION 

Production-rule  representation  schemes  are  .  Ideiy  used  in  numerous  expert  system 
applications.  The  reasons  for  this  are  based  upon  evolution  and  the  belief  that  such  schemes 
are  simple  to  understand  and  implement.  Unfortunately,  production-rule  expert  systems 
facilitate  unstructured  addition,  deletion  and  modifications  of  rules.  Consequently,  such 
schemas  are  fraught  with  many  undesirable  features  that  include  failure,  duplication, 
incorrect  results  and  knowledge-base  updating  complications.  These  features  can  be 
eliminated  in  the  n-Cube  schema  by  structured  hierarchical  decomposition  and  highly 
specific  localised  rule  sets.  Furthermore,  the  n-Cube  ''an  assist  in  knowledge  acquisition  by 
highlighting  the  deficiencies  in  a  knowiedge-base  and  may  also  enhance  the  quality  of  any 
derived  explanations.  As  such,  the  n-Cube  schema  is  a  powerful  tool  that  can  be  used  to 
debug,  develop  and  implement  expert  system  applications. 
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